Tweedie's Formula and Selection Bias.

نویسنده

  • Bradley Efron
چکیده

We suppose that the statistician observes some large number of estimates z(i), each with its own unobserved expectation parameter μ(i). The largest few of the z(i)'s are likely to substantially overestimate their corresponding μ(i)'s, this being an example of selection bias, or regression to the mean. Tweedie's formula, first reported by Robbins in 1956, offers a simple empirical Bayes approach for correcting selection bias. This paper investigates its merits and limitations. In addition to the methodology, Tweedie's formula raises more general questions concerning empirical Bayes theory, discussed here as "relevance" and "empirical Bayes information." There is a close connection between applications of the formula and James-Stein estimation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficacy of cognitive-behavioural therapy and other psychological treatments for adult depression: meta-analytic study of publication bias.

BACKGROUND It is not clear whether the effects of cognitive-behavioural therapy and other psychotherapies have been overestimated because of publication bias. AIMS To examine indicators of publication bias in randomised controlled trials of psychotherapy for adult depression. METHOD We examined effect sizes of 117 trials with 175 comparisons between psychotherapy and control conditions. As ...

متن کامل

Model Selection in Classification: the Swapping Method

In this article, the bias of the empirical error rate in supervised classification is studied. The exact formula and a robust estimator of the bias are given. From these results, we propose a new penalized criterion to perform model selection in classification. Applications to simulated and real data are presented.

متن کامل

Estimating Gene Expression and Codon-Specific Translational Efficiencies, Mutation Biases, and Selection Coefficients from Genomic Data Alone‡

Extracting biologically meaningful information from the continuing flood of genomic data is a major challenge in the life sciences. Codon usage bias (CUB) is a general feature of most genomes and is thought to reflect the effects of both natural selection for efficient translation and mutation bias. Here we present a mechanistically interpretable, Bayesian model (ribosome overhead costs Stochas...

متن کامل

The Accuracy and Bias of Single-Step Genomic Prediction for Populations Under Selection

In single-step analyses, missing genotypes are explicitly or implicitly imputed, and this requires centering the observed genotypes using the means of the unselected founders. If genotypes are only available for selected individuals, centering on the unselected founder mean is not straightforward. Here, computer simulation is used to study an alternative analysis that does not require centering...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of the American Statistical Association

دوره 106 496  شماره 

صفحات  -

تاریخ انتشار 2011